AmpliconDuo: A Split-Sample Filtering Protocol for High-Throughput Amplicon Sequencing of Microbial Communities.

نویسندگان

  • Anja Lange
  • Steffen Jost
  • Dominik Heider
  • Christina Bock
  • Bettina Budeus
  • Elmar Schilling
  • Axel Strittmatter
  • Jens Boenigk
  • Daniel Hoffmann
چکیده

High throughput sequencing (HTSeq) of small ribosomal subunit amplicons has the potential for a comprehensive characterization of microbial community compositions, down to rare species. However, the error-prone nature of the multi-step experimental process requires that the resulting raw sequences are subjected to quality control procedures. These procedures often involve an abundance cutoff for rare sequences or clustering of sequences, both of which limit genetic resolution. Here we propose a simple experimental protocol that retains the high genetic resolution granted by HTSeq methods while effectively removing many low abundance sequences that are likely due to PCR and sequencing errors. According to this protocol, we split samples and submit both halves to independent PCR and sequencing runs. The resulting sequence data is graphically and quantitatively characterized by the discordance between the two experimental branches, allowing for a quick identification of problematic samples. Further, we discard sequences that are not found in both branches ("AmpliconDuo filter"). We show that the majority of sequences removed in this way, mostly low abundance but also some higher abundance sequences, show features expected from random modifications of true sequences as introduced by PCR and sequencing errors. On the other hand, the filter retains many low abundance sequences observed in both branches and thus provides a more reliable census of the rare biosphere. We find that the AmpliconDuo filter increases biological resolution as it increases apparent community similarity between biologically similar communities, while it does not affect apparent community similarities between biologically dissimilar communities. The filter does not distort overall apparent community compositions. Finally, we quantitatively explain the effect of the AmpliconDuo filter by a simple mathematical model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform

BACKGROUND Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. METHODS We present a triple-i...

متن کامل

A novel conceptual approach to read-filtering in high-throughput amplicon sequencing studies

Adequate read filtering is critical when processing high-throughput data in marker-gene-based studies. Sequencing errors can cause the mis-clustering of otherwise similar reads, artificially increasing the number of retrieved Operational Taxonomic Units (OTUs) and therefore leading to the overestimation of microbial diversity. Sequencing errors will also result in OTUs that are not accurate rec...

متن کامل

High-throughput amplicon sequencing reveals distinct communities within a corroding concrete sewer system.

Microbially induced concrete corrosion (MICC) is an important problem in sewers. Here, small-subunit (SSU) rRNA gene amplicon pyrosequencing was used to characterize MICC communities. Microbial community composition differed between wall- and ceiling-associated MICC layers. Acidithiobacillus spp. were present at low abundances, and the communities were dominated by other sulfur-oxidizing-associ...

متن کامل

Seasonal variations of microbial community in a full scale oil field produced water treatment plant

This study investigated the microbial community in a full scale anaerobic baffled reactor and sequencing batch reactor system for oil-produced water treatment in summer and winter. The community structures of fungi and bacteria were analyzed through polymerase chain reaction–denaturing gradient gel electrophoresis and Illumina high-throughput sequencing, respectively. Chemical oxygen demand eff...

متن کامل

Species Identification and Profiling of Complex Microbial Communities Using Shotgun Illumina Sequencing of 16S rRNA Amplicon Sequences

The high throughput and cost-effectiveness afforded by short-read sequencing technologies, in principle, enable researchers to perform 16S rRNA profiling of complex microbial communities at unprecedented depth and resolution. Existing Illumina sequencing protocols are, however, limited by the fraction of the 16S rRNA gene that is interrogated and therefore limit the resolution and quality of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PloS one

دوره 10 11  شماره 

صفحات  -

تاریخ انتشار 2015